Explorer Scholarly Context Adrift : Three out of Four URI References Lead to Changed Content

نویسندگان

  • Shawn M. Jones
  • Herbert Van de Sompel
  • Harihar Shankar
  • Martin Klein
  • Richard Tobin
  • Claire Grover
چکیده

Increasingly, scholarly articles contain URI references to “web at large” resources including project web sites, scholarly wikis, ontologies, online debates, presentations, blogs, and videos. Authors reference such resources to provide essential context for the research they report on. A reader who visits a web at large resource by following a URI reference in an article, some time after its publication, is led to believe that the resource’s content is representative of what the author originally referenced. However, due to the dynamic nature of the web, that may very well not be the case. We reuse a dataset from a previous study in which several authors of this paper were involved, and investigate to what extent the textual content of web at large resources referenced in a vast collection of Science, Technology, and Medicine (STM) articles published between 1997 and 2012 has remained stable since the publication of the referencing article. We do so in a two-step approach that relies on various well-established similarity measures to compare textual content. In a first step, we use 19 web archives to find snapshots of referenced web at large resources that have textual content that is representative of the state of the resource around the time of publication of the referencing paper. We find that representative snapshots exist for about 30% of all URI references. In a second step, we compare the textual content of representative snapshots with that of their live web counterparts. We find that for over 75% of references the content has drifted away from what it was when referenced. These results raise significant concerns regarding the long term integrity of the web-based scholarly record and call for the deployment of techniques to combat these problems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Scholarly Context Adrift: Three out of Four URI References Lead to Changed Content

Increasingly, scholarly articles contain URI references to "web at large" resources including project web sites, scholarly wikis, ontologies, online debates, presentations, blogs, and videos. Authors reference such resources to provide essential context for the research they report on. A reader who visits a web at large resource by following a URI reference in an article, some time after its pu...

متن کامل

Scholarly Context Not Found: One in Five Articles Suffers from Reference Rot

The emergence of the web has fundamentally affected most aspects of information communication, including scholarly communication. The immediacy that characterizes publishing information to the web, as well as accessing it, allows for a dramatic increase in the speed of dissemination of scholarly knowledge. But, the transition from a paper-based to a web-based scholarly communication system also...

متن کامل

Simple algorithm to assess the diversity and distribution for algae of Iran

By studying algae reports from different localities of the country during the past 87 years, about 6000 records information including 2567 species and infra-specifics were analyzed and distribution maps were drawn. The algae studies in the Iranian islands of the Persian Gulf, especially around Bushehr and the Khark Island by Borgesen (1930), the oldest available document is used in this study. ...

متن کامل

Scholarly blogging practice as situated genre: an analytical framework based on genre theory

Introduction. Examines how an analytical framework of situated genre analysis can be used to study how research blogs are constructed and used as tools in scholarly communication. Method. A framework was extracted from genre research theories consisting of four concepts: aim, form, content and context. The term situated genre was used to focus on social practices. The context was further elabor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017